Functional requirements for reward-modulated spike-timing-dependent plasticity.

نویسندگان

Nicolas Frémaux

Henning Sprekeler

Wulfram Gerstner

چکیده

Recent experiments have shown that spike-timing-dependent plasticity is influenced by neuromodulation. We derive theoretical conditions for successful learning of reward-related behavior for a large class of learning rules where Hebbian synaptic plasticity is conditioned on a global modulatory factor signaling reward. We show that all learning rules in this class can be separated into a term that captures the covariance of neuronal firing and reward and a second term that presents the influence of unsupervised learning. The unsupervised term, which is, in general, detrimental for reward-based learning, can be suppressed if the neuromodulatory signal encodes the difference between the reward and the expected reward-but only if the expected reward is calculated for each task and stimulus separately. If several tasks are to be learned simultaneously, the nervous system needs an internal critic that is able to predict the expected reward for arbitrary stimuli. We show that, with a critic, reward-modulated spike-timing-dependent plasticity is capable of learning motor trajectories with a temporal resolution of tens of milliseconds. The relation to temporal difference learning, the relevance of block-based learning paradigms, and the limitations of learning with a critic are discussed.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Theoretical Analysis of Learning with Reward-Modulated Spike-Timing-Dependent Plasticity

Reward-modulated spike-timing-dependent plasticity (STDP) has recently emerged as a candidate for a learning rule that could explain how local learning rules at single synapses support behaviorally relevant adaptive changes in complex networks of spiking neurons. However the potential and limitations of this learning rule could so far only be tested through computer simulations. This article pr...

متن کامل

Reinforcement Learning Through Modulation of Spike-Timing-Dependent Synaptic Plasticity

The persistent modification of synaptic efficacy as a function of the relative timing of pre- and postsynaptic spikes is a phenomenon known as spike-timing-dependent plasticity (STDP). Here we show that the modulation of STDP by a global reward signal leads to reinforcement learning. We first derive analytically learning rules involving reward-modulated spike-timing-dependent synaptic and intri...

متن کامل

Reward-modulated spike-timing-dependent plasticity with a dynamic spike timing rule and inhibitory plasticity

The viability of spike-timing-dependent plasticity (STDP) to explain learning processes is controversial, although recent developments of reward-modulated STDP (RM-STDP) models provide a plausible substrate. However, evidence has also emerged to show that rewards themselves can modify the STDP rule. In this modeling study, we use a dynamic STDP rule to show that such modification can lead to ne...

متن کامل

Spike timing dependent plasticity: mechanisms, significance, and controversies

Long-term modification of synaptic strength is one of the basic mechanisms of memory formation and activity-dependent refinement of neural circuits. This idea was purposed by Hebb to provide a basis for the formation of a cell assembly. Repetitive correlated activity of pre-synaptic and post-synaptic neurons can induce long-lasting synaptic strength modification, the direction and extent of whi...

متن کامل

A Learning Theory for Reward-Modulated Spike-Timing-Dependent Plasticity with Application to Biofeedback

Reward-modulated spike-timing-dependent plasticity (STDP) has recently emerged as a candidate for a learning rule that could explain how behaviorally relevant adaptive changes in complex networks of spiking neurons could be achieved in a self-organizing manner through local synaptic plasticity. However, the capabilities and limitations of this learning rule could so far only be tested through c...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

The Journal of neuroscience : the official journal of the Society for Neuroscience

دوره 30 40 شماره

صفحات -

تاریخ انتشار 2010

Functional requirements for reward-modulated spike-timing-dependent plasticity.

نویسندگان

چکیده

منابع مشابه

Theoretical Analysis of Learning with Reward-Modulated Spike-Timing-Dependent Plasticity

Reinforcement Learning Through Modulation of Spike-Timing-Dependent Synaptic Plasticity

Reward-modulated spike-timing-dependent plasticity with a dynamic spike timing rule and inhibitory plasticity

Spike timing dependent plasticity: mechanisms, significance, and controversies

A Learning Theory for Reward-Modulated Spike-Timing-Dependent Plasticity with Application to Biofeedback

عنوان ژورنال:

اشتراک گذاری